The influence of system calls and interrupts on the performance of a PC cluster using a remote DMA communication primitive
نویسندگان
چکیده
This paper presents an efficient MPI implementation on a cluster of PCs using a remote DMA communication primitive. For experimental purposes, the MultiPC (MPC) parallel computer was used. It consists of standard PCs interconnected through a gigabit High Speed Link (HSL) network. This paper focuses on communication software layers over the HSL network. Two implementations of MPI are described. The first one uses hardware interrupts for network events signaling and system calls in the communication critical path. The second one is based on full userlevel communications. Measures show a latency of 15 s on a Pentium II-350 with this optimized implementation. A quantitative analysis shows how system calls and interrupts impact on communication time. To tally performance in a realistic environment, experiments were run on the Gauss elimination method using a parallel implementation of a local numerical analysis computational package (CADNA).
منابع مشابه
The Structural Influence of Entrepreneurial Leadership, Communication Skills, Determination and Motivation on Sales and Customer Satisfaction
This paper provides a critical perspective on entrepreneurial characteristics and gives an input to the discussion on the influence of entrepreneurial leadership, communication skills, determination and motivation on sales and customer satisfaction. It also presents the findings from an empirical study examining the structural effect of these four entrepreneurial characteristics on performance....
متن کاملDesign and Construction of an Aerosol Particle Classification System Based on Electrical Mobility
Introduction: The application of particles’ electrical mobility in the electric field has always been an important concern, as the functional basis of a number of particle measuring and classification instrumentations. The objective of this study was to design and construct an aerosol particles classification system using electrical mobility feature in laboratory scale. Methodology: This labo...
متن کاملEnhancing the Performance of Tiled Loop Execution onto Clusters Using Memory Mapped Network Interfaces and Pipelined Schedules
This paper describes the performance benefits attained using enhanced network interfaces to achieve low latency communication. Our experimental testbed concerns the parallel execution of tiled nested loops onto a Linux PC cluster with PCI-SCI NICs (Dolphin D330). Tiles are necessarily exchanging data and should also have large computational grain, so that their parallel execution becomes benefi...
متن کاملPerformance Evaluation of Local Detectors in the Presence of Noise for Multi-Sensor Remote Sensing Image Matching
Automatic, efficient, accurate, and stable image matching is one of the most critical issues in remote sensing, photogrammetry, and machine vision. In recent decades, various algorithms have been proposed based on the feature-based framework, which concentrates on detecting and describing local features. Understanding the characteristics of different matching algorithms in various applications ...
متن کاملFabrication and Characterization of Nanostructured TiO2 and Turmeric Spent Incorporated Polystyrene Hybrid Nano Composites
A series of polystyrene hybrid nanocomposites have been fabricated with varying amounts of TiO2 viz., 0, 0.5 and 1 % w/w along with 3% TS by in-situ polymerization method. The influence of surface modified TiO2 nanoparticles on the thermal properties of PS matrix was examined using thermogravimetry and differential scanning calorimetry. Thermal characteristics of the polystyrene/TS/TiO2 hybrid ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002